Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

#407 added option to reduce flushing of streams during write operation. #432

Merged
merged 2 commits into from
Dec 22, 2023

Conversation

dxdjgl
Copy link

@dxdjgl dxdjgl commented Nov 19, 2023

No description provided.

/// <summary>
/// Reduce flushing of stream during write, to increase performance of for instance Azure blob
/// </summary>
public bool ReduceStreamFlushing { get; set; }
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think it might be better to reduce flushing by default and have no option - less permutations to support.

Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

i took the liberty of doing so, hope you don't mind

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sorry but I dont agree, since it was found out that removing that flush will make parquet.net hang in this method call
public Task WriteColumnAsync(DataColumn column, CancellationToken cancellationToken = default)
in some schenarios. So I still believe the best solution for now is to have the flush handling configurable, until a permanent fix for the stuck Task has been found.

Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sorry missed this issue. I'll add it back now and release a patch. Is there any way we can reproduce the failure with a unit test btw?

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please have a look at the code I sent in a mail to you.

@aloneguid aloneguid modified the milestones: 4.19.0, 4.18.0 Dec 22, 2023
@aloneguid aloneguid merged commit 0f46d34 into aloneguid:master Dec 22, 2023
5 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[BUG]: Performance decrease since > 4.11 or higher when writing into Azure blob stream
2 participants